charade caption dataset
Country:
- Asia > China > Beijing > Beijing (0.06)
- Oceania > Australia > New South Wales > Sydney (0.05)
- North America > United States > New York > Monroe County > Rochester (0.05)
Supplementary Material for Multi-modal Dependency Tree for Video Captioning
The evaluation results on the Charades Captions dataset are shown in Table 2. Figure 1: Qualitative results of the generated tree structure and sentences on the MSR-VTT dataset. Table 1: The average sentence lengths of ground-truth captions and the captions generated by "w/o Lower average edit distance is better. In this section, we illustrate more details of human evaluation. We recruited 10 annotators to carry out the human evaluation process. The user interface for human evaluation is shown in Figure 4. To ensure Do the main claims made in the abstract and introduction accurately reflect the paper's Did you discuss any potential negative societal impacts of your work?
Country:
- Asia > China > Beijing > Beijing (0.06)
- Oceania > Australia > New South Wales > Sydney (0.05)
- North America > United States > New York > Monroe County > Rochester (0.05)
Country:
- Asia > China > Beijing > Beijing (0.04)
- North America > United States > New York > Monroe County > Rochester (0.04)
Technology:
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Vision (0.96)
- Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)